AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal JSON Output

# Multimodal JSON Output

Otpensource Vision
A vision-language model trained based on Bllossom/llama-3.2-Korean-Bllossom-AICA-5B, supporting Korean and English, specializing in image-to-text and text classification tasks in the fashion domain.
Image-to-Text Transformers Supports Multiple Languages
O
hateslopacademy
14
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase